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Abstract 


Homogenization is a powerful way of taming a class of finite structures with 
several interesting applications in different areas, from Ramsey theory in 
combinatorics to constraint satisfaction problems (CSPs) in computer sci- 
ence, through (finite) model theory. A few sufficient conditions for a class of 
finite structures to allow homogenization are known, and here we provide a 
necessary condition. This lets us show that certain natural classes are not 
homogenizable: 1) the class of locally consistent systems of linear equations 
over the two-element field or any finite Abelian group, and 2) the class of 
finite structures that forbid homomorphisms from a specific MSO-definable 
class of structures of treewidth two. In combination with known results, 
the first example shows that, up to pp-interpretability, the CSPs that are 
solvable by local consistency methods are distinguished from the rest by the 
fact that their classes of locally consistent instances are homogenizable. The 
second example shows that, for MSO-definable classes of forbidden patterns, 
treewidth one versus two is the dividing line to homogenizability. 


1 Introduction 


A relational structure with a countable domain is called homogeneous if it is highly 
symmetric: any isomorphism between any two of its finite induced substructures 
extends to an automorphism of the whole structure. In many areas of combi- 
natorics, logic, discrete geometry, and computer science, homogeneous structures 
abound, often in the form of nicely behaved limit objects for classes of finite struc- 
tures. Typical examples include the Rado graph R, which can be seen as the limit 
of the class of all finite graphs; the linear order of the rational numbers Q, seen as 


*A shorter preliminary version of this paper was published in Proceedings of 25th EACSL 
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the limit of all finite linear orders; or the countable Urysohn space U, the limit of 
all rational metric spaces. The literature on the subject is very extensive; we refer 
the reader to for a recent survey. 

Homogeneous structures arise as limits of well-behaved classes of finite struc- 
tures in a way made precise by Fraissé’s theorem, which describes them combina- 
torially in a finitary manner. The theorem states that a homogeneous structure 
is characterized, up to isomorphism, by its age, i.e., the class of its finite induced 
substructures. Moreover, classes of finite structures arising as ages of homogeneous 
structures are precisely Fraissé classes, i.e., classes closed under taking induced sub- 
structures and under amalgamation — a form of glueing pairs of structures along 
a common induced substructure (see and Section [2] for precise definitions). 

Thanks to Fraissé’s theorem, combinatorial arguments involving finite struc- 
tures can often be replaced by, or aided by, arguments involving highly symmetric, 
infinite structures. In combinatorics, for example, homogeneous structures appear 
unavoidably in structural Ramsey theory [19]. At the intersection between com- 
binatorics and computer science, homogeneous structures appear in the theory of 
logical limit laws for various models of random graphs [16]. In computer science 
proper, homogeneous structures appear in the theory of constraint satisfaction 
problems [5], and automata theory [6], and verification [7]. 

One of the advantages of working with homogeneous structures, rather than 
classes of finite structures, is that their automorphism groups are very rich. For 
example, over a finite relational signature, the homogeneity of the structure im- 
mediately implies that, up to automorphism, it has finitely many elements, pairs 
of elements, triples, etc. In model theoretic terms, this means that the structure is 
w-categorical by the classical Ryll-Nardzewski theorem, and its first-order theory 
admits elimination of quantifiers. In turn, since in any such structure there are 
only finitely many first-order definable relations of each arity, homogeneous struc- 
tures over finite relational signatures are, in a strong technical way, close to being 
finite. 

Thus, with Fraissé’s theorem in hand and the many applications of homo- 
geneous structures in mind, it becomes quite important a task to identify more 
Fraissé classes. More generally, one would like to identify classes of finite struc- 
tures that are perhaps not Fraissé classes themselves, but appear as reducts of some 
Fraissé class over a richer yet finite signature. Such classes of finite structures are 
called homogenizable [9]. The point in case is that the lifted Fraissé class can be 
thought of as taming its reduct by providing a homogeneous structure that plays 
the role of limit object for it. Many of the application examples mentioned above 
do actually go through lifted Fraissé classes and their corresponding homogeneous 
Fraissé limits. 

A noticeable amount of work has gone into providing sufficient conditions for 


a class of finite structures to be homogenizable. Instances include the model- 
theoretic methods of Covington [9], and the combinatorial explicit constructions of 
Hubička and NeSetiil [14]. Here we provide a combinatorial necessary condition for 
homogenizability (Theorem. 1]in Section [3). This allows us to prove that certain 
natural classes of finite structures previously considered in the literature are not 
homogenizable. Since it is known that every Ramsey class is homogenizable [18], 
our result may also be relevent in the context of NeSetril’s classification programme 
of Ramsey classes. See and the introduction of the recent survey for more 
on this. 

Our first example of a non-homogenizable class comes from the theory of con- 
straint satisfaction problems (CSPs). We show that the class of locally consistent 
systems of linear equations over the two-element field is not homogenizable. More 
generally, the result holds for systems of equations over any finite Abelian group. 
This answers a question first raised by the first author of this paper in 2]. Pre- 
cisely, by a locally consistent system of equations we mean one whose satisfiability 
cannot be refuted by the (j,k)-consistency algorithm for small j and k, which 
is a well-studied heuristic algorithm for solving CSPs. Moreover, in combination 
with the resolution of the Bounded Width Conjecture by Barto and Kozik [4], this 
shows that the constraint languages whose classes of locally consistent instances 
are homogenizable are, up to pp-interpretability, precisely those that are solvable 
by local consistency methods. All this is worked out in Section 

In Section[5}we give a second example of a non-homogenizable class that, in this 
case, is motivated by the works of Hubička and Nešetřil [I4], and Erdés, Tardif, 
and Tardos [10]. It was shown in that every class of finite structures that is 
of the form Forb,(F), where F is a regular class of connected finite structures, is 
homogenizable. In words, Forb;,(F) is the class of finite structures that do not ad- 
mit homomorphisms from any structure in F. The notion of regularity considered 
in [14] is closely related to the notion of regularity in automata theory, and agrees 
with it on coloured paths and trees. However, our second example shows that 
even if F is MSO-definable and has maximum treewidth two, the class Forb;,(F) 
need not be homogenizable. Since MSO-definability coincides with automatic reg- 
ularity for coloured paths and trees, this shows that for MSO-definable classes, 
treewidth one versus two of the forbidden structures in F is the dividing line to 
homogenizability. 


2 Preliminaries 
Signatures, structures, reducts, and expansions. A relational signature X 


is a set of relation symbols R,, Ro,..., each with an associated natural number 
called its arity. In this paper, we consider only finite relational signatures. A 


b-structure A = (A; R$, R$, ...) is composed of a set A, called its domain, and 
a relation RA C A* on A for each R in X, where k is the arity of R. We say 
that R^ is the interpretation of R in A. We write |A| to denote the cardinality of 
the domain of A. A %-structure is sometimes referred to as a structure over the 
signature X. If X% is a signature that contains © and At is a ©*-structure, then 
the “-reduct of At is the structure A obtained from At by forgetting all relations 
from ©* — È. In this case, we also say that At is an expansion of A. Expansions 
and reducts are also called lifts and shadows, respectively. 


Substructures, homomorphisms, and embeddings. If A is a structure 
and X is a subset of its domain A, we write A[X] for the substructure of A induced 
by X, that is, the U-structure with domain X in which each relation symbol R in 
X is interpreted by RAM X*, where k is the arity of R. 

Let A and B be structures over the same relational signature X. Let A and B 
denote their domains. A homomorphism from A to B is a mapping f : A > B 
for which the inclusion f(R’) C RP holds for every R in ©. The homomorphism 
is strong if in addition the inclusion f(A* — R^) C B* — RP holds for every R 
in X, where k is the arity of R. A monomorphism from A to B is an injective 
homomorphism. Whenever A is a subset of B and the inclusion mapping A > B 
is a monomorphism, we say that A is a substructure of B. An embedding from 
A to B is an injective strong homomorphism. Whenever A is a subset of B and 
the inclusion mapping A — B is an embedding, we say that A is an induced 
substructure of B. An isomorphism from A to B is a surjective embedding. If there 
is an isomorphism from A to B we say that the two structures are isomorphic. If 
f: A — B is a partial mapping with domain X C A and image Y C B, we say 
that f is a partial homomorphism from A to B if it is a homomorphism from A[X] 
to B[Y]. We write (7) to denote the set of all embeddings from A to B. Sometimes 
we write f : A — B to mean that f is a mapping from the domain of A to the 
domain of B. 


Amalgamation. If B and C are /-structures with domains B and C, we write 
BUC for their union, i.e. the 4-structure with domain BUC and relations RYC = 
RE U R? for every R in X. Let f and g be embeddings from the same structure 
A into structures B and C, respectively. The structure D is an amalgam of B and 
C through f and g if there exist embeddings f’ and g’ from B to D and C to D, 
respectively, such that the diagram in Figure [1] commutes, ie, T Of ja og, 
We say that D is a strong amalgam if f’(B)Ng'(C) = (f'o fA) = (g'o g)(A), 
where A, B and C denote the domains of A, B and C, respectively. We say that 
D is a free amalgam if it is strong and, additionally, D = D[f’(B)] U D[g'(C)]. We 
also say that D is the union of B and C amalgamated along A through f and g via 


Yy 


FIGURE 1: Amalgamation of B and C through f and g. All mappings are embeddings. 


f' and g'. Note that the free amalgam of B and C through f and g is uniquely 
defined up to isomorphism, and is isomorphic to the disjoint union of B and C, 
quotiented by the equivalence relation identifying f(x) with g(x), for x in A. We 
denote this free amalgam f Ua g. When f and g are implicit, we denote it BU, C. 
We also say that B and C are glued along A. 


Classes of structures. All our structures will have finite or countably infinite 
domain. Moreover we assume that all structures have a domain that is a subset 
of a common background countable set, say N. For a fixed signature X}, a class of 
structures is a set of structures that is closed under isomorphisms, i.e. if A and B are 
isomorphic structures and A belongs to the class, then B also belongs to the class. 
A class of structures C is closed under amalgamation if for every two embeddings 
f and g from the same structure A in C into structures B and C in C, there exists 
in C an amalgam of B and C through f and g. A class of finite structures is an 
amalgamation class, also called a Fraissé class, if it is closed under taking induced 
substructures and amalgamation. For example, the class of all finite graphs is an 
amalgamation class — in fact, it is closed under free amalgamation — so is the class 
of all finite digraphs. The class of all finite linear orders is also an amalgamation 
class, although it is not closed under free amalgamation. Fraissé’s theorem states 
that a class is Fraissé if and only if it is the class of finite induced substructures 
of a homogeneous structure. 

For two signatures © and £* with the second containing the first, if C and C+ 
are classes of U-structures and t-structures, respectively, then we say that C is 
the S-reduct of Ct if C is the class of -reducts of the structures in CT. 


Homogenizable classes. We say that a class of “-structures is homogenizable 
if there is a signature =* extending ©, and an amalgamation class Ct of X+- 
structures, such that C is the X-reduct of Ct. For a class of -structures F, 
let Forb,(F) denote the class of all finite U-structures A such that for no F in 
F there is a homomorphism from F to A. Hubička and NeSetiil define a notion 
of regularity, which we call HN-regularity (we omit its technical definition), and 
prove in Theorem 3.1 from [14] that if F is a HN-regular class of finite connected 
structures, then Forb,(F) is homogenizable. In particular, if F is finite, then 
Forb;(F) is homogenizable. 


Example 2.1. Let X be the signature that consists of one binary predicate E 
and two unary predicates S and T. Let P,, denote a simple directed E-path 
with n nodes from a unique S-colored node to a unique 7-colored node. The class 
F = {Pa : n > 1} is HN-regular, and therefore, by [L4], the class Forb,(F) is 
homogenizable. It consists of digraphs whose nodes are possibly labeled with S 
or T, and there is no directed path from an S-labeled node to a T-labeled node. 
We show that Forb,(F) is homogenizable by a direct construction. Let X+ be 
the extension of © by two unary predicates J and O. Let Ct consist of all X+- 
structures At such that the domain of At is partitioned into J^" and O^, and 
that SAt C I^, T C O^ , and there are no E-edges starting in J^ and ending 
in O^. Then C+ is an amalgamation class, as it is closed under free amalgamation. 
The class Forb,(F) is the -reduct of C*: a structure A in Forb,(7) expands to 
a structure At in Ct, in which J^" is the set of vertices reachable from S^ by a 
directed E-path, and O^ is its complement. 


3 Necessary condition for homogenizability 


Fix a finite relational signature X. Except for the examples, in this section all 
structures are over this signature, or over a signature X+ that extends X. Before 
we state the necessary condition for homogenizability we need some notation and 
terminology. 

Let C be a class of finite structures. If A, L and R are structures in C, and 
L: A —> L and R: A > R are embeddings such that no amalgam of L and R 
through L and R is in C, then we say that L : A> L, R: A > R is a diagram 
that witnesses failure of amalgamation of C. We illustrate the definitions with a 
running example. 

Example 3.1 (Running example). Let F,, denote the structure depicted in Figure[2| 
with n vertices in the middle column. The signature © of this structure consists of 
one binary predicate E, the undirected edges, one binary predicate E , the vertical 
directed edges, and four unary predicates R (for red), B (for blue), S (for source), 


FIGURE 2: Forbidden structure Fn. 


and T (for target), each appearing in the structure exactly once. Observe that the 
colours © and T ensure that F is an antichain in the homomorphism pre-order, 
i.e. there are no homomorphisms from F,, to Fm ifn # m. Let C = Forb,(F). In 
the running example, we will demonstrate that the class C is not homogenizable. 

Choose a large natural number n. Let L denote the left part of the structure Fn 
obtained by removing the blue vertex (labeled B). Symmetrically, let R denote the 
right part of F,, obtained by removing the red vertex (labeled R). Let A denote 
the intersection of L and R, i.e., the E-path with n vertices starting at the S- 
labeled vertex and ending at the T-labeled vertex. Let L: A —> Land R: A —> R 
be the inclusion mappings. Then any amalgamation of L and R necessarily is a 
homomorphic image of F,. Hence L : A > L, R: A > R is a diagram that 
witnesses failure of amalgamation of C. 


Let L: A > L, R: A —> R bea diagram that witnesses failure of amalgamation 
of C. For a structure J and a partial mapping C : a — {L, R}, let J? be the 
structure that is obtained by glueing to J, for each 7 in Dom(C), a fresh copy of 
either L or R depending on whether C (r) = L or C(t) = R. More formally, J© is 
defined by induction on the cardinality of the domain of C: if Dom(C) = Ø, then 
J? = J; otherwise, if C = C’ U {n 4 o}, where m € (3) and a € {L, R}, then 
define J? = 7’ Ua o, where m : A => J7 is m : A— J composed with the identity 
embedding from J to J”. 

For a natural number m and a &-structure A with domain A, let A®m denote 
the structure with domain A x [m] in which the interpretation of a relation R in X 
of arity k is the set of all tuples ((a1, 71), (a2, i2),..., (ak, ip)) where (a1,..., ap) € 
R^ and ii,...,i% € [m]. Observe that every function f : A — [m] induces an 
embedding a; : A > A Q m, defined by m(a) = (a, f(a)). Let Eam denote the set 
of all embeddings of the form rẹ for f : A — [m]. In particular, €,,, is a subset 


of qe containing exactly m!4! embeddings. 
A diagram L: A —> L, R: A-— Ris confusing for C if the following conditions 
hold: 


1. it witnesses failure of amalgamation of C, and 


2. for every natural number m, if J = A@m, then for every coloring C : Egm ~> 
{L, R} the structure J? belongs to the class C. 


Its order is the cardinality of the domain of A. 

Theorem 3.1 If C is a homogenizable class of finite structures, then there exists 

a natural number r such that every confusing diagram for C has order at most r. 
This theorem is the main technical result of this paper. Before we prove it, we 

illustrate it by applying it to our running example. 


Example 3.2. Fix natural numbers m and n. Let L : A > L and R: A —> R be 
defined as in Example The structure J = A & m is depicted in Figure 
Its domain is [n] x [m], and every element (i,j) € [n] x [m] with i < n — 1 is 


FIGURE 3: The structure AQ m, with an embedding T € Ehm. 


connected by an E-edge to every element (i+ 1, k) € [n] x [m]. The embeddings 
Eam correspond to functions f : [n] > [m]. If C : Eam > {L, R} is a coloring, 
then J? is obtained by considering all functions f : [n] + [m], and connecting 
every vertex along the path {(i, f(i)):1 <i < n} to a fresh vertex which is red 
if C(f) = L, and blue if C(f) = R. Observe that no structure F in F maps 
homomorphically to J”. Therefore, J? belongs to C = Forb;(F). Since m is 
arbitrary, this shows that the diagram L : A > L, R : A —> R is confusing for C. 
Since its order is |A| = n, and n is arbitrary, Theorem implies that C is not a 
reduct of any amalgamation class. 


Theorem [3.1] follows easily from Lemma stated below. 

Let L: A> L, R: A —> R witness failure of amalgamation of C. An (L, R)- 
confusion for C is a structure J in C, together with a set E€ C ver such that J© is in 
C for every coloring C : E > {L, R}. For E C (2) and a natural number r bounded 
by the cardinality of the domain of A, let €, denote the set of all restrictions 7| x 
of m in E, where X ranges over all r-element subsets of the domain of A. 


Lemma 3.2 Letr and t be natural numbers, and let C be a class of X-structures. 
There exist numbers p and q (depending on r and t only) such that the following 
condition implies that C is not a reduct of any amalgamation class over a signature 
with at most t predicates of arity at most r: 


there is a diagram L : A> L, R : A> R that witnesses failure of 
amalgamation of C and of order at least r, and there is an (L, R)- 
confusion (J, E) for C satisfying 


E| > p- lE] +h). (1) 


Before we prove Lemma\3.2| we show how Theorem [8.1] follows from it. 


Proof of Theorem [8.1 Suppose that C has confusing diagrams of arbitrarily large 
order. For every two fixed natural numbers r and t, we apply Lemma to 
conclude that C is not a reduct of an amalgamation class over a signature with 
t symbols of arity at most r. Let p and q be as in the statement of the lemma. 
Consider a confusing diagram L : A > L, R : A > R and let n be its order. 
Fix a natural number m, and let J = A & m and E = Eam. Then (J,€) is an 
(L, R)-confusion for C, by the definition of confusing diagram, and |E| = m” and 
|E | = m”. Since the order n of the diagram can be chosen arbitrarily large, we 
can assume n > r. Taking m large enough, so that p- m”! > gh) and m > 2p, 
we get: 
pillta) = p-m +a) <p-m™+p-m™ <m" = |E], 

which gives condition (1) in Lemma [3.2] Since t and r were arbitrary, this proves 
that C is not the reduct of an amalgamation class. 


It remains to prove the lemma. 


Proof of Lemma[3.4 Fix natural numbers r and t. In anticipation of the proof, let 
q be the maximum number of atomic types of (r+1)-tuples over any signature with 
at most t predicates of arity at most r, and let p = {log,(q)|. Suppose that C is a 
class of S-structures as in the lemma, with a diagram L : A > L, R: A —> R that 


witnesses its failure of amalgamation, and an (L, R)-confusion (J,€) satisfying 
condition from Lemma 3.2| 

Let Bt be a }t-structure with domain B and let f : A + B be a function from 
some set A to B. Define the pullback f*(B) as the =*-structure with universe A 
in which the interpretation of a relation symbol R of £* of arity k is f~1(R®), i.e., 
the inverse image of the interpretation of R in B under the mapping f : A’ > B®. 
By definition, f*(B) is the unique “*-structure on A for which f is a strong 
homomorphism into B. 

By definition of the structure J©, there is a distinguished embedding of J 
into J°. Therefore, by composition, any embedding m : A > J in € defines an 
embedding of A into J°, denoted 7: A > J°. Note that for any expansion Jt of 
JF, the pullback #*(J*) is an expansion of A, which is isomorphic (via 7) to an 
induced substructure of J*. 

Claim 1 There is a coloring C : E > {L, R} such that, for every expansion Jt 
of J? over the signature &*+, there are two embeddings 7 ando in E such that the 
pullbacks 7*(J*) and &* (JF) are equal, but C (T) 4 C(c). 

We show how the claim yields the lemma. Figure |4| illustrates the proof. 

Assume that C is the class of S-reducts of a class of ©*-structure C*. To 
reach a contradiction, suppose that Ct is closed under induced substructures and 
amalgamation. Let C be as in the claim. Since J? belongs to C by the definition 
of confusion, there exists an expansion J* of J? in C+. Let m and ø be as in the 
conclusion of the claim, and suppose without loss of generality that C(t) = L and 
C(c) = R. By the definition of J], the embeddings 7: A —> J and L : A > L 
induce embeddings 7, 7’, f, such that the diagram to the left below commutes: 


Let L = f*(J*) be the pullback structure; this structure is an expansion of L. 
Moreover, L" belongs to the class C*, since it is a pullback under an injective 
mapping, and hence Lt is isomorphic to an induced substructure L} of J+, which 
is in C*. 

Similarly, the embeddings o : A —> J and R : A — R induce embeddings 
G,o’,g such that the diagram to the right above commutes. Let R? = g*(J*) 
be the pullback structure, which is an expansion of R, isomorphic to an induced 
substructure Rt of Jt, hence belongs to the class CT. 
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FIGURE 4: Two embeddings 7,0 € E with C(x) = L and C(o) = R, in the context 
of the running example. The colored arrows depict various predicates of the stipulated 
signature X extending © (in general, they don’t need to be binary). The fact that the 
sequences of colors along T and along o are the same corresponds to the assumption that 
the pullbacks %*(J*) and 6*(J*) are equal. Therefore, the marked substructures Lt and 
Rt of J* (which correspond to L* and Rt in the proof via the mappings f and g) have 
an isomorphic substructure (isomorphic to A* in the proof). An amalgamation in C* of 
L} and RE along this substructure would yield as a S-reduct an amalgamation in C of L 
and R along A, a contradiction. 


Let At be the pullback 7*(J*), which, by the claim, is the same as the pullback 
a*(J*). Note that by commutativity of the diagram to the left above, the pullback 
m*(J*) is the same as the pullback L*(L*). Similarly, o*(J*) is the same as 
R*(R*). In other words, L is an embedding of At into L*, and R is an embedding of 
At into Rt. Since Ct is closed under amalgamation, there exists an amalgamation 
of the diagram L : At > L* and R: At > R+, which consists of a structure Ut 
in Ct and two embeddings L’ : Lt + Ut and R’: Rt > Ut. Taking »-reducts, 
we obtain an amalgamation in C of L : A— Land R: A— R. But the pair of 
embeddings L and R were supposed to witness failure of amalgamation in C — a 
contradiction proving that C* cannot be closed under amalgamation. 

Next we show how to prove Claim[I]and hence Lemmaf3.2| Call any embedding 
in € a spot, and any restriction of a spot to an r-element subset of the domain of A 
a partial spot. For each coloring C of the spots, and each two spots 7 and a, define 
mt &c o if and only if C(a) = C(a). For each coloring D of the partial spots, and 
each two spots 7 and o, define 7 ~p ø if and only if D(z|x) = D(o|x) for every 
r-element subset X of the domain of A. Both are equivalence relations on spots. 
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Claim 2 There is a coloring C of the spots using two colors, such that for all 
colorings D of the partial spots using q colors, there is a pair of spots t and o such 
that r ~po but a Æc. 


Proof. For this proof, let n be the cardinality of the domain of A and assume 
without loss that the domain of A is [n] = {1,...,n}. Let N be the number of 
spots and let M be the number of partial spots. With this notation, condition (1) 
reads as follows: 

N>p-M4+q(). (2) 
Color the spots independently at random with either L or R, each with probabil- 
ity 1/2. Let C be the random variable describing this process. In particular, C is 
a random variable taking as values strings of length N over alphabet {L, R}, each 
with the same probability. Thus the binary entropy h(C) of the random variable 
C is equal to N. 

Suppose for contradiction that the opposite of what the claim states holds. 
Then there is a random variable D taking as values colorings of the partial spots 
using q colors such that the inclusion ~p C &q holds with probability 1. The 
relation ~p has at most gh) equivalence classes; for each fixed spot 7, there are 
at most q choices of colors for each of the (”) restrictions 7|x to r-element subsets 
X C [n], and any two spots sharing these choices are equivalent. In particular, 


there is a random variable E taking as values strings of length q6) over alphabet 
{L, R} such that E and D determine C. That is, h(C | Æ, D) = 0, or equivalently, 


h(C, E, D) = ih, D). 
We will show that this is impossible by proving that 
h(E, D) < N = h(C) < h(C, E, D). 


Indeed, D is determined by (") random variables {Dx : X C [n], |X| = r}, where 
the random variable Dx takes as values the colorings of the restrictions of the 
spots to the subset X. If Mx denotes the number of such restrictions, the random 
variable Dx takes values in [g|“*, and therefore 


h(Dx) < log(q”*) = log.(q)- Mx < p- Mx. 


Noting that M is the sum of My as X ranges over all r-element subsets of [n], it 
follows that 


h(D) < X h(Dx) < Sop: Mx =p: Y Mx =p- M. 
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Moreover h(E) < gh) since E takes as values strings of length gh) over alphabet 
{L, R}. Hence 


h(E, D) < h(E) +h(D) < gh) +p- M. 


However states that this quantity is strictly smaller than N, as required. 


Finally we use Claim [2] to prove Claim [I] Let C be the coloring of Claim 2] with 
the two colors interpreted as the embeddings L : A > Land R: A —> R. For each 
expansion J* of J, let D be the coloring of partial spots defined as follows: for 
each spot 7 and each r-element subset X = {i} <... < ip} of the domain of A, let 
D(m|x) be the atomic type of (m(71),...,7(i,)) in Jt. This is a coloring of partial 
spots using at most q colors. By Claim P| there is a pair of spots 7 and a such that 
T~po but t %c o. From 7 ~p o and the fact that r is at least as large as the 
maximum arity of any new predicate in ©*, it follows that the pullbacks 7*(JT) 
and 6*(J*) are equal. On the other hand, from 7 %c o we get C(t) 4 C(o) by 
definition. This proves Claim [I]and Lemma B.2| 


4 Classes of consistent structures 


In this section we work out the first of our two examples of non-homogenizable 
classes. We start by defining some basic notions from the theory of constraint 
satisfaction problems as described, for example, in Chapter 6 of the monograph 
[12]. Recall that, for a structure T, we write CSP(T) for the class of all finite 
structures l over the same signature as T for which there is a homomorphism from 
l to T. The I’s are called instances, the T’s are called templates. 


4.1 Local consistency 


Let X be a relational signature, let A and B be »-structures, and let k and l be 
integers such that 1 < k < l. A (k,1)-consistent family on A and B is a non-empty 
family F of partial homomorphisms from A to B, such that the following three 
conditions hold for each f in F: 


1. |Dom(f)] <L, 

2. if X is a subset of A, then f|x is in F, 

3. if |Dom(f)| < k and X is a subset of A such that Dom(f) C X and |X| < l, 
then there exists g in F such that Dom(g) = X and f C g. 


If there is a (k,l)-consistent family on A and B, then we say that A is (k,1)- 
consistent with respect to B. Note for later use that the class of structures that 
are (k,/)-consistent with respect to B is closed under inverse homomorphisms: if 
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there is a homomorphism from A’ to A, and A is (k,/)-consistent with respect to 
B, then A’ is also (k,/)-consistent with respect to B. To see this, it suffices to 
compose the homomorphism from A’ to A with each partial homomorphism in the 
(k,1)-consistent family for A to get a (k,/)-consistent family for A’. 

We describe the special case of (2,3)-consistency in terms of a pebble game. 
The game is played between spoiler and duplicator, each having three pebbles, 
numbered 1, 2 and 3. Spoiler can place his pebbles on the nodes of A, while 
duplicator can place his pebbles on the nodes of B. They can also keep the pebbles 
in their pockets, in which they have all pebbles at the beginning of the game. The 
game proceeds in rounds as follows. In each round, spoiler places some of the 
pebbles from his pocket on the nodes of A and duplicator replies by placing his 
corresponding pebbles on the nodes of B. If the partial mapping defined by the 
pebble placement is not a partial homomorphism from A to B, then duplicator loses. 
Otherwise, spoiler puts back some of the pebbles into his pocket, and duplicator 
removes the corresponding pebbles, and the game continues to the next round. 
It is not hard to see that A is (2,3)-consistent with respect to B if and only if 
duplicator can avoid losing forever. 


4.2 Systems of linear equations over Fə 


We define a finite template Tə that can be used to represent the solvability of 
systems of linear equations over the 2-element field. Let us note that our definition 
of the template Tə will not be the standard one as it can be found, for example, in 
the original Feder-Vardi paper [11]. The main difference is that we want to have 
a signature of smallest possible arity, in this case two. We achieve this by letting 
T be the natural encoding of the standard template as its incidence structure. 
Concretely, Tə is defined as follows. Its domain is D U R, where 


D = {0,1}, 
R= {(2,y,z) € D? : x+y +z = 0 mod 2}. 
The elements of D are called values, and those of R are called triples. The signature 
X includes three partial functions 7, 72, and 73 that map triples in R to values in 
D, and four unary relations value, triple, Co and C1. Formally, in order to have a 
relational structure, Tz has binary relations that correspond to the graphs of the 
partial functions 7, 72 and 73. The interpretations of the symbols in Tə are as 
follows: 
1. T1, T2 and 73 map (z,y,z) in R to z, y and z, respectively, 
2. value holds of all elements in D, 


3. triple holds of all elements in R, 
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4. Co holds of 0 in D, and 
5. Cı holds of 1 in D. 


The purpose of triple is to encode equations of the type x +y +z = 0 mod 2, and 
the purposes of Co and C4 are to encode equations of the type x = 0 and x = 1, 
respectively. Note that even though the language does not allow writing more 
complicated equations, such as z + y + z = 1 mod 20rw+a2+y+z=0 mod 2, 
such equations can be simulated in the language of Tə with the help of auxiliary 
variables. 


Theorem 4.1 The class of all finite structures that are (2,3)-consistent with re- 
spect to Tz is not homogenizable. 


Proof. In the following, we fix the template T = Tz, and when we refer to con- 
sistency, we mean (2,3)-consistency with respect to T. Finite structures on the 
signature of T are called instances. Homomorphisms f : 1 — T from an instance 
1 to T are called solutions. By C denote the class of consistent instances. Ob- 
serve that, as noted earlier, the class of consistent instances is closed under inverse 
homomorphisms. 

The plan is to apply Theorem [3.1] to C, and for that we need to find a confusing 
diagram L: A> L, R : A —> R with arbitrarily large A. 

Let n > 8 be an exact power of two. Let t be a rooted, ordered tree with n 
leaves at depth log,(n); in particular, no node at depth 2 is a leaf, and no node at 
depth log,(n) —1 is a root. Let l be the instance obtained from t, with elements of 
two types: nodes, which correspond to the nodes of t, and triples, which correspond 
to triples (v, vo, v1), where v is an internal node in t, and vo and v are its left and 
right sons, respectively. Nodes are labeled by the unary predicate value and triples 
are labeled by the unary predicate triple. We say that the triple (v, vo, v1) is the 
triple below node v, and is adjacent to, or contains v, vo, and vı. For each such 
triple, we declare: 


father (v, vo, v1) = T1 (V, vo, U1) =v, 
left(v, V0, v1) = T2(v, Vo, vı) = V0, 


right(v, Vo, V1) = 73(¥, Vo, V1) = U1. 


We call a structure of this kind simply a tree. Since we will work with /-structures 
that are made of trees, for the sake of intuition from now on we use the names 
father, left, and right in place of 71, m2, and 73. If zi is a value in D, then the 
i-marking of J is the /-structure m;(1) obtained from I by marking the root by 
the predicate C;. Observe that in any solution v : m;(l) > T of m;(1), the sum of 
the values of the leaves is equal to 7 modulo 2. Conversely, any mapping from the 
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leaves of l to T such that the sum of the values of the leaves is equal to 7 modulo 2 
extends uniquely to a solution v : m;(1) > T. 

The structures L and R are the markings mo(1) and m (1) of the tree 1, respec- 
tively. The structure A is the substructure of | induced by the leaves of the tree. 
Note that A consists of n isolated points, labeled by the unary relation value. The 
unary relations triple, Co and C4, as well as the binary relations 7, 72, and 73, are 
empty in A. Note that L and R share A as an induced substructure. Let L : A —> L 
and R : A— R be the corresponding embeddings. 


Lemma 4.2 The free amalgam of L and R through L and R is inconsistent. 


Proof. When spoiler has only two pebbles on the board, we allow him to perform 
a move we call a slide, in which he moves one pebble from a node v to a triple 
adjacent to it, or from a triple to a node belonging to this triple. Duplicator 
has to respond accordingly: if spoiler slides a pebble from a node v to a triple t 
containing v on the i-th coordinate, then duplicator must move his corresponding 
pebble from a value v in D to a triple in R containing v on the i-th coordinate. 
Symmetrically, in the case when spoiler slides his pebble from a triple to the node 
in the i-th coordinate, duplicator must move his corresponding pebble from the 
corresponding triple to the value on its i-th coordinate. The slide moves can be 
simulated in the original game, using a third pebble. 

Denote the two (overlapping) trees lz and Ip, respectively; they have common 
leaves in the free amalgam L Ua R. Here is the strategy for spoiler; it consists of 
several steps. In the beginning of the n-th step, spoiler has two pebbles placed on 
corresponding nodes a and b of lz and Ip, at depth n — 1 of the tree. In particular, 
in the beginning of the first step, two pebbles are placed on the roots of I; and 
lg, respectively. For a node v on which spoiler has his pebble, denote by r(v) the 
value of the corresponding pebble of duplicator. The invariant is that r(a) 4 (0). 
This invariant is clearly satisfied in the beginning of the first step, since lz has its 
root labeled with Co and Ip has its root labeled with C4. 

In the n-th step, spoiler slides his pebble from node a to the triple a’ below 
a in lz, and then slides his pebble from node b to the triple b below 6 in Ip. 
Duplicator’s responses have to satisfy r(father(a’)) = r(a) and r(father(b')) = 
r(b). In particular, r(father(a’)) 4 r(father(b’')), by the invariant. It follows that 
left(r(a’))+right(r(a’)) F left(r(b'))+ right(r(b’)), so either left(r(a’')) A left(r(b’)) 
or right(r(a')) A right(r(b')) (or both). Since the cases are symmetric, suppose 
without loss of generality that the first case occurs. Then spoiler slides the pebble 
from a’ to left(a’) and then slides the pebble from b’ to right(b’), and continues the 
game from these two nodes playing the role of a and b. The invariant is satisfied. 

Since in each step the depth of a increases by 1, at some point, a must be a 
leaf of lz, and b is the corresponding leaf in Iz. But then a and b are the same 
element in L Ua R, and by the invariant r(a) # r(b). In other words, spoiler has 
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two pebbles placed at the same node of L Ua R, but the corresponding pebbles of 
duplicator are not placed on the same element of T. So duplicator loses. 


Lemma 4.3 Every amalgam of L and R through L and R is inconsistent. 


Proof. This follows at once from the previous lemma and the fact that C is closed 
under inverse homomorphisms. Indeed, the free amalgam L Ua R through L and 
R maps homomorphically to any amalgam of L and R through L and R. 


Let m be a natural number, and let J = A®m and E = Eam- 


Lemma 4.4 For every coloring C : E — {L, R}, the structure J? is consistent. 


Proof. We modify the game, by giving more power to spoiler. We show that even 
in this game, duplicator wins. In the modified game, the pebbles of spoiler can be 
placed only on triples of J], and the pebbles of duplicator can be placed only on 
triples of T. If the pebbles of spoiler are placed on triples a,,...,a,, with k < 3, 
then duplicator must have his corresponding pebbles placed on triples ¢;,...,¢, in 
T, so that the following conditions are satisfied: 


— Whenever a; is a triple containing a node with unary predicate Cj on some 
coordinate, then the same coordinate of t; is equal to j. 


— Whenever a; and a; agree on some coordinate, then t; and t; also agree on 
the same coordinate. 


We show how spoiler can copy a strategy which is winning in the original game to 
win in the modified game. 


Claim 3 If spoiler has a winning strategy in the original game, then he also has a 
winning strategy in the modified game. 


Proof. Suppose that in the original game spoiler places a pebble on a node v. We 
copy this move in the modified game by placing a pebble on any triple containing v 
on some coordinate, say, the i-th coordinate, and await the response of duplicator. 
If in the modified game duplicator places his corresponding pebble on a triple ¢ in 
T, then we pretend that the duplicator in the original game places his pebble on 
the i-th coordinate of t, and the game continues. At some point, duplicator loses 
in the original game. This means that one of two cases occurred in the original 
game: 


— Spoiler has placed a pebble on a node with unary predicate j and duplicator 
replied by placing his corresponding pebble on a value j’ with j’ Æ j. 
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— One pebble of spoiler is placed on a node v and another pebble of spoiler is 
placed on a triple t containing v on the i-th coordinate, and the correspond- 
ing pebbles of duplicator are placed on a value r(t) and a triple r(t) that, 
however, do not satisfy the condition that the i-th coordinate of r(v) equals 


r(t). 


Since duplicator is only copying his strategy from the modified game, it must be 
the case that duplicator must have lost as well in the modified game. In particular, 
if spoiler wins in the original game, then he wins in the modified game. 


We show a winning strategy for duplicator in the modified game on J]. By the 
claim above, this means that duplicator also has a winning strategy in the original 
game. 

The arena J? on which spoiler places his pebbles is a union of trees of the form 
1 glued along the leaves. Therefore, it is meaningful to talk about roots, children 
(or sons), brothers, and leaves, and parents in the case of nodes from trees which 
are not roots nor leaves (leaves have very many parents). Every triple in J] is of 
the form (v, vo, v1), where v is an internal node of some tree, and vo and v are its 
left and right son. 

Call two tree nodes v and w in J? congruent, and write v & w, if the following 
conditions hold: 


— The nodes correspond to the same node in I, 


— The leaves below v coincide with the leaves below w. 


We lift this notion to triples: two triples (v, vo, v1) and (w, wo, w1) are congruent, 
also written (v, vo, v1) = (w, wo, w1), if v = w, vo = wo, and vı = w1. Observe 
that two distinct roots in J] are not congruent, since by construction, not all their 
leaves are identified. 

During the game, let a,,...,a,, with k < 3, denote the triples on which the 
pebbles of spoiler are placed. Let X denote the set of nodes that are congruent to 
some component of some pebbled triple, and let X’ denote the union of X with 
the roots. We say that a function f : X’ > D is nice if it satisfies the following 
conditions. 


1. For every triple (x,y,z) in J°, if x,y,z € X’, then f(x) + f(y) + f(z) =0. 
2. For every root r, if r is marked with unary predicate C;, then f(r) = i. 


3. Whenever x,y € X’ are congruent, then f(x) = f(y). 


We show that duplicator has a strategy which satisfies the following invariant at 
each moment of the game: 
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There is a nice function f : X’ — D such that for each pebble of spoiler 
occupying a triple (x,y,z), duplicator’s corresponding pebble occupies 


the triple (f(x), f(y), F(z). 


At the beginning of the game, the invariant is satisfied: since X’ consists only 
of roots, we can define f(x) = i for a root x with unary predicate C;, yielding a 
nice function — the last condition of nicety holds since no two distinct roots are 
congruent. 

Suppose that at some moment during the game there is a function f satisfying 
the above conditions, and spoiler performs a move. If in this move he removes a 
pebble from some triple, then duplicator responds by removing the corresponding 
pebble from T, and it is easy to see that the restriction of f to the resulting set 
X’ satisfies the above conditions. 

Suppose now that spoiler makes his move by placing a new pebble on the board. 
In particular, before the move he had k < 2 pebbles on triples aı,..., ap, and a 
new pebble is placed on the triple az, which we denote c for simplicity. Below, 
unless indicated, when we speak about X, X’, or f, we refer to their values just 
before spoiler placed the new pebble on c. The case that c is a triple (v, vo, v1) 
with v, vo,v1 € X’ is trivial: duplicator just responds with (f(v), f(vo), f(v1)). 
This response is not loosing thanks to the invariant and the first two conditions of 
the nicety of f. Moreover, the values of X and X’ after duplicator’s response are 
unmodified, so the same function f can be used in the invariant. From now on we 
assume that at least one of the coordinates of c is not in X’. 

Note that after spoiler’s move, the new X’ includes the congruence classes of 
the three components of c. We say that a triple is completed by spoiler’s move if not 
all three components of the triple are in X’ before spoiler’s move, but the addition 
of these congruence classes to X’ makes all three components of the triple belong 
to the new X’. In particular, c and its congruents are completed by spoiler’s move. 
The new f after spoiler’s move will be defined to extend the old f by assigning 
values to the components of c and its congruents in such a way that the conditions 
of nicety are satisfied for the new X’. We need to distinguish several cases: 

Case 1: cis a triple (v, vo, v1) in which vp and v; are not leaves, and v is already 
in X’. Let voo and vo; be the left and right sons of vp, and let vip and v11 be those 
of vı. We need the following claim: 


Claim 4 There exist values i, io, and 1, in D such that 
1. io = f (vo) if vo belongs to X", 
2. i, = f(v) if vı belongs to X’, 
3. i+ig ti, =0, where i= f(v), 
4. io + f(vo0) + f(vo1) = 0 tf vo and vo, belong to X’, 
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5. in + f(vio) + f(v) = 0 tf vio and vy, belong to X’. 


Proof. Since not all three v, vo and vı are in X’ but v is in X’, at most one among 
vo and vı is in X’. It follows that not all four voo, Vol, v10, and vı; can be in X”. 
To argue for this, note that at most two pebbles occupy at most two triples tı 
and t2 before spoiler’s move, but it cannot be the case that tı & (vo, voo, Voi) and 
to = (v1, v10, Vi1) if not both vo and vı are in X’. Moreover, for the same reason, 
if both voo and vo; are in X’, then vı is not in X’, and if both vio and v11 are in 
X’, then vp is not in X’. We use this to choose ig and 7, by cases. 

Case (i): both voo and vp; are in X’. First choose ig to satisfy condition 4 and 
then choose 7; to satisfy condition 3. Note that in case condition 7 also applies, 
then the only choice of 79 that makes condition 4 hold is guaranteed to satisfy 
condition 1 too by the first condition of nicety of f. Note also that in this case 
conditions 2 and 5 do not apply. 

Case (ii): both vio and vy, are in X’. First choose 7, to satisfy condition 5 and 
then choose ip to satisfy condition 3. Again, note that in case condition 2 also 
applies, then the only choice of 7; that makes condition 5 hold is guaranteed to 
satisfy condition 2 too by the first condition of nicety of f. Note also that in this 
case conditions 7 and 4 do not apply. 

Case (iii): otherwise. In this case the only conditions that can apply are 1, 2, 
and 3, and among / and 2 at most one can apply. In case / applies and vg is in 
X’, first choose ig to satisfy condition 7 and then choose 7; to satisfy condition 3. 
In case 2 applies and vı is in X’, first choose 7; to satisfy condition 2 and then 
choose ig to satisfy condition 3. 


Case 2: cis a triple (v, vo, v1) in which vp and v; are not leaves, and v is not 
yet in X’. Let voo and vo; be the left and right sons of vp, and let vio and v11 be 
those of vı. Since v is not in X’, it is not a root. Let then w be the sibbling of v, 
and let u be their parent. We need the following claim: 


Claim 5 There exist values i, io, and 1, in D such that 
1. io = f (vo) if vo belongs to X", 
2. i, = f(v) if vı belongs to X’, 
Sada, 
4. f(u) + f(w) +71 =0 if w andu belong to X’, 
5. io + f(vo0) + f(vo1) = 0 if vo and vo, belong to X’, 
6. i1 + f(vio) + f(v) = 0 if vio and vy, belong to X’. 


Proof. If both vo and vı are in X’, we argue that w and u are not in X’. To see 
this, note that at most two pebbles occupy at most two triples before spoiler’s 
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move. But if both vg and v; are in X’, then these triples must contain nodes that 
are congruent to vo and vı, and be different and hence different from any triple 
that contains a node congruent to u or w, since all triples that contain both vp and 
vı are congruent to c. Thus, in case both vp and vı are in X’, we choose ip = f (vo) 
and 7; = f(v,), and 7 to satisfy condition 3. Conditions 1, 2 and 3 are then true 
by construction, condition 4 does not apply, and conditions 5 and 6 hold because 
f is nice with respect to X’. 

Assume then that not both vo and vı are in X’. In such a case we argue that 
not all four voo, vor, Vio, and vj; can be in X’. To see this, note again that at 
most two pebbles occupy at most two triples tı and t2, and it cannot be that 
tı = (vo, voo, Vor) and te S (v1, Vio, v11) if not both vo and v; are in X’. Moreover, 
for the same reason, if both voo and vo; are in X’, then v; is not in X’, and if both 
Vio and v4, are in X’, then vo is not in X’. We use this to choose ig and 7; by 
cases. In all cases we first choose 2 to satisfy condition 4. 

Case (i): both voo and vp; are in X’. First choose ip to satisfy condition 5 and 
then choose 7; to satisfy condition 3. Note that in case condition 7 also applies, 
then the only choice of iọ that makes condition 5 hold is guaranteed to satisfy 
condition 1 too by the first condition of nicety of f. Note also that in this case 
conditions 2 and 6 do not apply. 

Case (ii): both vio and vy; are in X’. First choose i; to satisfy condition 6 and 
then choose ip to satisfy condition 3. Again, note that in case condition 2 also 
applies, then the only choice of 7, that makes condition 6 hold is guaranteed to 
satisfy condition 2 too by the first condition of nicety of f. Note also that in this 
case conditions 7 and 5 do not apply. 

Case (iii): otherwise. In this case the only conditions that can apply are 1, 2, 
and 3 (and 4), and among / and 2 at most one can apply. In case / applies 
and vg is in X’, first choose ig to satisfy condition 7 and then choose 72; to satisfy 
condition 3. In case 2 applies and v; is in X’, first choose 7; to satisfy condition 2 
and then choose io to satisfy condition 3. 


Case 3 (and last): cis a triple (v, vo, v1) in which vp and v; are leaves. Since v 
is not a root, let w be its sibling, and let u be their parent. 


Claim 6 There exist values i, io, and 1, in D such that 
1. i= f(v) ifv belongs to X’, 
2. io = f(vo) if vo belongs to X’, 
3. i, = f(v) if vı belongs to X’, 
4. i+ io +i =0, 
5. f(u) + f(w) +i =0 ifw andu belong to X', 
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Proof. As in the previous case, if both vg and vı are in X’, then w and u are not 
in X’, but the argument to show why this is the case is slightly different. First 
note that if both v9 and vı are in X’ then v is not in X’ because not all three 
components of c are in X’ by assumption. Second, at most two pebbles occupy 
at most two triples before spoiler’s move. If both vg and vı are in X’, then these 
triples must contain vp and v1, which are congruent only to themselves, and be 
different and hence different from any triple that contains a node congruent to u 
or w, since all the triples that contains both vp and vı are congruent to c. Thus, 
in case both vp and vı are in X’, we choose ig = f(vo) and 7; = f(v1), and i to 
satisfy condition 4. Conditions 7 and 5 just do not apply. 

Assume then that not both vo and v; are in X’. In such a case, first choose i to 
satisfy condition 5. Note that if condition / also applies, then the unique choice 
that satisfies 5 also satisfies 7 by the first condition of the nicety of f. Once i is 
chosen, choose either to or 7, to satisfy whichever condition among 2 or 3 applies, 
if any, and then choose the other to satisfy condition 4. 


This completes the cases analysis over c. Now, fix i, 79, and 7; as in the claim in 
whichever of the three cases applies. We claim that f can be extended to a function 
g that is defined on v, vo, and vı so that g(v) = i, g(vo) = io, and g(v1) = i1, and 
that is nice with respect to the new X’. Indeed, let Y, Yo, and Y, denote the sets 
of nodes that are congruent to v, v1, and v1, respectively. We define the extension 
g of f by setting g(x) = i for all x € Y, g(x) = io for all x € Yo, and g(x) = i for 
all x € Yı. By the choices of i, 7; and ts in the claims, and the third condition of 
nicety of f, this is well defined for those x on which f was already defined. Note 
that the domain of g is precisely the value of X’ after spoiler’s move. Let us argue 
that g is nice with respect to this new X’. 

First we note that on all triples that are congruent to (v, vo, v1), its three com- 
ponents get the same three values (g(v), 9(vo), g(v1)). This shows that g satisfies 
the third condition of nicety with respect to the new X’. The second condition of 
nicety is also satisfied since g extends f and f was nice with respect to the old X’, 
which contained all roots already. Finally, in order to argue that g satisfies the 
first condition of nicety we need to argue which triples are completed by spoiler’s 
move. The triple c and its congruents are definitely completed and, for these, the 
condition i + i9 + 7, = 0 from the claims guarantees the first condition of nicety. 
The addition of v, vo, and vı to X’ can complete the triples (u, v, w), (vo, voo, Vor); 
and (v1, v10, v11), When they exist, and their congruents, but no other triples. And 
for these, the conditions of the claims guarantee that the choices of i, i9, and 21 
satisfy the first condition of nicety. 


Lemma and Lemma [4.4] show that the diagram L : A > L, R : A > Ris 
confusing for the class of consistent structures. Since A can be taken arbitrarily 
large, Theorem [4.1] follows immediately from Theorem [3.1 
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4.3 Other finite Abelian groups 


The template Tə for systems of equations over the 2-element field can be generalized 
to all finite Abelian groups. Let G be a finite Abelian group; we write + for the 
group operation and 0 for its neutral element. Let Tg be the structure with domain 
DUR, where 


D=G, 
R= {(x,y,z) €E Do: e+y+z=01. 


The elements of D are called values, and those of R are called triples. As in Tə, the 
signature of Tg has three binary relations 7m1, 72, and 73, two unary relations value 
and triple, and one unary relation Ca for each value a in D. The interpretations 
of all relation symbols are as in T; in particular, the unary relation symbol Ca is 
interpreted by the singleton set {a}. It is straightforward to check that Tg can be 
used to encode arbitrary systems of equations over G. As in the 2-element field 
case, equations more complex than the basic x+y+z = 0 or x = a can be encoded 
with the help of auxiliary variables. 

Theorem 4.5 If G is a finite Abelian group with at least two elements, then the 
class of all finite structures that are (2,3)-consistent with respect to Tg is not 
homogenizable. 


Proof. The proof of Theorem does not rely in any way on the fact that the 
group is addition mod 2, except for it being Abelian and having at least two 
different values in it. 


It is known that, for any non-trivial finite Abelian group, the constraint satis- 
faction problem of the template Tg has unbounded width, i.e. for every two natural 
numbers k and / there exist instances | that do not have homomorphisms to Tg, 
but are nonetheless (k,/)-consistent with respect to Tg. We also say that Tg does 
not have (k,/)-width for any k and l. This was proved by Feder and Vardi 
for the standard template for linear equations mod 2, and later alternative proofs 
generalize quite well to the template Tg (see, for instance, [I]). Moreover, the 
solution to the Bounded-Width Conjecture of Barto and Kozik [4] implies that all 
cases of templates of unbounded width are explained by the unbounded width of 
some Tg. Technically: 


Theorem 4.6 ([4], see also Theorem 4.1 in [8]) Let T be a core finite relational 
structure. If T does not have bounded width, then T pp-interprets Tg for some 
non-trivial finite Abelian group G. Moreover, if the signature of T has maximum 
arity at most two, then the conclusion holds even if T does not have (2,3)-width. 


Thus, the templates Tg are in a strict formal sense the canonical templates of 
unbounded width. Theorem states that, for all such templates, their class of 
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locally consistent instances is non-homogenizable. Interestingly, the converse to 
this is also true in a strong sense: for all templates that do have bounded width, 
their class of locally consistent instances is homogenizable. This follows quite di- 
rectly from the fact that, for every finite template T, the class of instances | that 
have a homomorphism to T is homogenized by expanding them by all their homo- 
morphisms to T. When these two observations are put together, we get that, up to 
the relation of pp-interpretability between templates, which is known to preserve 
the property of having bounded width, the templates that have bounded width 
are distinguished from those that do not by the fact that their classes of locally 
consistent instances are homogenizable. It seems plausible that our Theorem 
could be adapted to show that all templates of unbounded width give themselves a 
non-homogenizable class of locally-consistent instances, without the need to resort 
to pp-interpretability, but this remains open. 


5 Classes defined by forbidden homomorphisms 


The positive result of Hubička and Nešetřil [14] shows that if F is an HN-regular 
class of finite connected structures, then the class Forb,(F) is a reduct of an amal- 
gamation class. HN-regularity is a notion reminiscent of the notion of regularity of 
word languages or of tree languages. Indeed, in the case of structures of treewidth 
one, HN-regularity and regularity in the sense of tree automata both correspond 
to MSO-definability. In this section we give an example of a non-homogenizable 
class of finite structures that is of the form Forb;(G), where G is an MSO-definable 
class of connected finite structures of treewidth two. It follows from the result of 
Hubička and Nešetřil that this is optimal. Recall that the treewidth of a finite 
structure is defined as the treewidth of its Gaifman graph, and pathwidth is a 
restriction of treewidth (for definitions see, for example, [8]). 


5.1 Pathwidth three 


Consider the class F from the running example in Section [3 This was shown non- 
homogenizable in Example The class F can be defined by an MSO sentence, 
which expresses that there are exactly four colored points, which are colored R, B, 
S, and T, respectively, and the rest of points form a directed simple E-path from 
S to T with all vertices along the path connected by an undirected E-edge to both 
Rand B. Moreover, each structure F in F is connected and has pathwidth three: 
take the path-decomposition of the E-path with bags of size 2 and add both red and 
blue vertices to each bag. This gives a path-decomposition with bags of size 4, so its 
pathwidth is 3 (thanks to the —1 in the definition of treewidth/pathwidth). Now 
we show how to modify the class F to obtain a class of structures of treewidth two. 
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5.2 Treewidth two 


Consider a rooted, directed binary tree, in which every node is either a leaf, or an 
inner node with two sons, in which case it has a directed Ep-edge to its left son and 
a directed E, -edge to its right son. Color its root red, by labeling it with the unary 
predicate R, and create an extra blue vertex, with unary predicate B, connected to 
all the leaves of the tree by an undirected E-edge. An example of such a structure, 
obtained from a full binary tree of depth 4, is depicted in Figure|5| Let G denote 
the class of all structures obtained in this way. The signature X of these structures 


FIGURE 5: Forbidden structure G, with 16 leaves. 


consists of three binary predicates E, Eo, and E, for the edges, and two unary 
predicates R and B, each appearing in the structure exactly once as indicated. 
It is straightforward to check that G is MSO-definable on the class of all finite 
structures. Moreover, each structure G in G is connected and has treewidth two: 
just take a tree-decomposition of the binary tree and add the blue point to all its 
bags. 


Claim 7 The class G forms an antichain in the homomorphism preorder. 


Proof. Suppose that h : G1 —> C» is a homomorphism of two structures in G. Then 
h must map the root of G4 to the root of Cə (since only the root is colored red), and 
must map the leaves of G; to the leaves of G (since only the leaves are adjacent to 
a blue node). Finally, a vertex v in G; reached from the root by a path with labels 
iiio... ip € {0,1}* must be mapped to the unique vertex w of G, reached from the 
root by the path obtained by reading the same labels. Hence, the mapping f is 
injective. Since no inner node of the tree can be mapped to a leaf, f must also be 
surjective. It follows that h is an isomorphism. 


Proposition 5.1 The class Forb,(G) is not homogenizable. 
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Proof. We apply Theorem |3.1} To this end, choose an arbitrary structure G € G, 
and consider the diagram L : A > L, R : A —> R defined as follows. L is the left 
part of the structure G, obtained by removing the blue vertex (labeled B), R is the 
right part of the structure G, obtained by keeping the blue vertex and the nodes 
adjacent to it, A is the intersection of L and R, i.e., the substructure of G induced 
by the leaves of the underlying binary tree. Let L : A > L and R: A —> R be 
the two inclusions. It is clear that every amalgamation of L,R must contain a 
homomorphic image of O, so L, R witnesses failure of amalgamation of Forb;,(G). 
Let m be an arbitrary number, J = A 8 m, E = Eam, and C: E > {L, R} be any 
coloring. 


Claim 8 The structure J? does not contain a homomorphic image of any structure 


in G. 


Proof. Assume that h : G’ — J° is a homomorphism and GC’ € G. Then the root 
v of G’ is mapped to some red vertex h(v) in J, let m : A — J be the embedding 
corresponding to the vertex h(v). In particular, C(7) = L. Let f be the embedding 
of L into J? induced by the embedding 7 : A > J and L: A > L. By the same 
argument as in the proof of Claim [7] h must map the nodes of G’ injectively into 
the structure f(L). Moreover, h maps the leaves of G’ into elements of J C JF 
(since only those vertices may be adjacent to a blue node). As in the proof of 
Claim [7] it follows that h is a bijection from G’ without the blue node to the image 
of f. In particular, the leaves in G’ are mapped bijectively to the image of m. But 
since C(7) = L, it is impossible that all the nodes in the image of m are adjacent 
to a common blue vertex. 


Hence, the diagram L, R is confusing for C. Since G can be chosen so that A is 
arbitrarily large, the conclusion follows from Theorem |3.1 


5.3 Optimality 


We argued already that the classes F from Example and G from Section 
are MSO-definable. Therefore, the set of colored paths that represent the path- 
decompositions of the structures in F is regular in the automata-theoretic sense, 
and the set of colored trees that represent the tree-decompositions of the struc- 
tures in G is regular in the tree-automata-theoretic sense (see [8]). It is interesting 
to check why F and G are not regular classes of structures in the sense of Defi- 
nition 2.3 of Hubitka-NeSetril [14]. By Example [3.2] and Proposition [5.1] we know 
that F and G cannot be HN-regular as otherwise Forb;,(F) and Forb,(G) would 
be homogenizable by Theorem 3.1 in [4]. 

In order to check that a class is not HN-regular it suffices to identify minimal 
g-separating g-cuts of unbounded sizes in its structures. For F, note that the set 
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of all vertices in the E-path is a minimal g-separating g-cut in Fẹ, and its size 
is k and hence unbounded. For G, the set of all leaves in the binary tree in any 
structure G in G is a minimal g-separating g-cut, and its size is also unbounded 
since all trees are represented in G. 

To close this section we note that every MSO-definable class of finite con- 
nected structures of treewidth one is HN-regular. This follows from the fact noted 
earlier that, for colored trees, HN-regularity, tree-automata regularity, and MSO- 
definability are equivalent. In particular, by Theorem 3.1 in [14], every class of the 
form Forb;(F), where F is an MSO-definable class of connected finite structures 
of treewidth at most one, is homogenizable. 


6 Conclusion 


We study homogenizability — a combinatorial notion useful in computer science (see 
e.g. [5], [6], [7]). Our main contribution is a necessary condition for homogenizability 
of a class of finite structures. We apply it to prove non-homogenizability of a 
class related to constraint satisfaction problems, consisting of locally consistent 
instances with respect to the template for linear equations over a finite Abelian 
group, and a class defined by forbidding homomorphisms from an MSO-definable 
class of structures of treewidth two, which is tight by the positive result of [14]. 

Our original motivation for studying the homogenizability of classes of CSP 
instances came from an approach, first outlined in [2], to characterize the finite 
templates that are solvable by any sound consistency algorithm. This was applied 
in [2] to get (yet) a(nother) criterion to decide solvability by the arc-consistency 
algorithm, and it was asked if this kind of technology could also work for (k,/)- 
consistency. While our negative results of Section |4]rule out the direct applicability 
of this method to (k, /)-consistency, perhaps an indirect method could still work for 
these or other consistency algorithms. For example, perhaps one could first decide 
if the class of consistent instances with respect to the template T is homogenizable, 
and apply the method from [2] only in the relevant case that it is. 

This raises the very interesting question of deciding whether a finitely presented 
class of finite structures is homogenizable, and for that we need conditions that 
are both necessary and sufficient. Significant steps in that direction were taken in 
the Hubitka-NeSetril paper [14] for classes of the form Forbą (F). Perhaps special 
cases that are still enough for the method to work are easier. Can we characterize 
the classes of the form Forb, (F), with MSO-definable F say, that are reducts of 
classes that are closed under induced substructures and free amalgamation? Such 
classes we call freely homogenizable. 
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